Learning-Based Facial Animation

نویسنده

  • Robert Bargmann
چکیده

This thesis proposes a novel approach for automated 3D speech animation from audio. An end-to-end system is presented which undergoes three principal phases. In the acquisition phase, dynamic articulation motions are recorded and amended. The learning phase studies the correlation of these motions in their phonetic context in order to understand the visual nature of speech. Finally, for the synthesis phase, an algorithm is proposed that carries as much of the natural behavior as possible from the acquired data to the final animation. The selection of motion segments for the synthesis of animations relies on a novel similarity measure, based on a Locally Linear Embedding representation of visemes, which closely relates to viseme categories defined in articulatory phonetics literature. This measure offers a relaxed selection of visemes, without reducing the quality of the animation. Along with a general hierarchical substitution procedure which can directly be reused in other speech animation systems, our algorithm performs optimum segment concatenation in order to create new utterances with natural coarticulation effects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High-resolution Animation of Facial Dynamics

This paper presents a framework for performance-based animation and retargeting of high-resolution face models from motion capture. A novel method is introduced for learning a mapping between sparse 3D motion capture markers and dense high-resolution 3D scans of face shape and appearance. A high-resolution facial expression space is learnt from a set of 3D face scans as a person specific morpha...

متن کامل

Subtle Facial Animation Transfer from 2D Videos to 3D Faces with Laplacian Deformation

Realistic facial animation transfer from one individual to others has been a persistent challenge. In this paper, we present an effective method that transfers facial animation from 2D videos onto 3D faces in a visually pleasing manner. Our method is based on a Laplacian deformation framework. We represent the facial animation with the displacements of a set of feature points. By the assumption...

متن کامل

Learning and synthesizing MPEG-4 compatible 3-D face animation from video sequence

In this paper, we present a new system that applies an example-based learning method to learn facial motion patterns from a video sequence of individual facial behavior such as lip motion and facial expressions, and using that to create vivid threedimensional (3-D) face animation according to the definition of MPEG-4 face animation parameters. The system consists of three key modules, face trac...

متن کامل

A Machine Learning Approach to Automate Facial Expressions from Physical Activity

We propose a novel approach based on machine learning to simulate facial expressions related to physical activity. Because of the various factors they involve, such as psychological and biomechanical, facial expressions are complex to model. While facial performance capture provides the best results, it is costly and difficult to use for real-time interaction during intense physical activity. A...

متن کامل

Vision Based Speech Animation Transferring with Underlying Anatomical Structure

We present a novel method to transfer speech animation recorded in low resolution videos onto realistic 3D facial models. Unsupervised learning is utilized on a speech video corpus to find underlying manifold of facial configurations. K-means clustering is applied on the low dimensional space to find key speaking-related facial shapes. With a small set of laser scanner captured 3D models relate...

متن کامل

Expression transfer for facial sketch animation

This paper presents a hierarchical animation method for transferring facial expressions extracted from a performance video to different facial sketches. Without any expression example obtained from target faces, our approach can transfer expressions by motion retargetting to facial sketches. However, in practical applications, the image noise in each frame will reduce the feature extraction acc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012